Reducing power consumption in an electronic device

ABSTRACT

Ingress packet processors in a device receive network packets from ingress ports. A crossbar in the device receives, from the ingress packet processors, packet data of the packets and transmits information about the packet data to a plurality of traffic managers in the device. Each traffic manager computes a total amount of packet data to be written to buffers across the plurality of traffic managers, where each traffic manager manages one or more buffers that store packet data. Each traffic manager compares the total amount of packet data to one or more threshold values. Upon determining that the total amount of packet data is equal to or greater than a threshold value, each traffic manager drops a portion of the packet data, and writes a remaining portion of the packet data to the buffers managed by the traffic manager.

BACKGROUND

This specification relates to reducing power consumption in anelectronic device.

Some electronic devices receive network packets through ingress ports ofthe devices, for forwarding through egress ports of the devices. In somecases, an electronic device temporarily stores packet data in buffers,before the packet data are forwarded to packet queues for transmissionthrough the egress ports to external destinations.

SUMMARY

This specification describes packet processing in an electronic devicehaving a number of innovative aspects and related technologies. Theelectronic device includes one or more ingress ports for receivingnetwork traffic as packets, the ingress ports divided into one or moregroups of ingress ports, where each ingress port group is connected toan ingress packet processor. The electronic device also includes one ormore egress ports for transmitting network traffic as packets towardsthe intended destinations, the egress ports divided into one or moregroups of egress ports, where each egress port group is connected to anegress packet processor. An ingress port in this context corresponds toa transmission channel that is associated with a source device, while anegress port corresponds to a transmission channel that is associatedwith a destination device. Different ingress and egress ports areassociated with different transmission channels and the different portsprocess data for different destinations. For each egress packetprocessor, the electronic device includes one or more dedicated packetqueues, which may also referred to as egress queues or egress buffers,in memory for temporarily storing packet data to be transmitted throughegress ports coupled to the egress packet processor.

The electronic device includes one or more traffic managers for managingthe writing of packet data to the packet queues. Each traffic managerincludes and manages one or more memory buffers (also referred to simplyas buffers) to temporarily store packet data that are to be sent topacket queues, for forwarding through an egress port group coupled to anegress packet processor. A traffic manager receives packet data from theingress ports in units of data segments, which are referred to as cells,where a packet is composed of one or more cells. A crossbar, whichinterconnects the ingress packet processors to the traffic managers,broadcasts packet data from the ingress packet processors to all trafficmanagers in parallel. Each traffic manager reads state informationprovided along with each cell of packet data to determine if packet dataincoming in the next cycle should be written to buffers managed by thetraffic manager. By analyzing the state information for all the cellsbroadcast by the crossbar, each packet manager independently computesthe total amount data, e.g., total number of cells, to be written acrossall the traffic managers in the next cycle.

In some implementations, each traffic manager compares the total amountof data to be written in the next cycle to one or more threshold levels.When the amount of data exceeds the one or more threshold levels, thetraffic manager drops—instead of storing in buffers managed by thetraffic manager—cells of some packets that are destined for egress portscorresponding to the buffers managed by the traffic manager. In someimplementations, a traffic manager selectively drops cells of particularpacket types while storing cells of other packet types. In someimplementations, each traffic manager independently determines how manycells to drop, or cells of which packet types to drop, or both.

In some implementations, the electronic device also includes one or moreingress arbiters with traffic shapers, with each ingress arbiterconnecting one of the ingress port groups to the corresponding ingresspacket processor. Each ingress arbiter determines a traffic rate atwhich the corresponding ingress packet processor transmits packet datato the traffic managers. Depending on the traffic rate, thecorresponding traffic shaper generates a bucket of tokens that areallocated to manage the data rate of the corresponding group of ingressports connected to the ingress arbiter. Upon receiving cells of a packetfrom an ingress port, the ingress arbiter determines, by querying itstraffic shaper, whether a token is available for the ingress port group.If a token is available, then the ingress arbiter schedules the receivedcell for forwarding to the corresponding ingress packet processor.However, if a token is not available, then the traffic shaper preventsthe ingress arbiter from scheduling new cells to be forwarded to thecorresponding ingress packet processor, leading to the cells eitherbeing dropped, or temporarily stored in an input buffer until a tokenbecomes available or a timer expires, whichever occurs first.

The subject matter described in this specification can be realized inparticular implementations so as to achieve one or more of the followingadvantages. Managing storage of cells in buffers by the traffic managersby comparison to threshold levels helps to reduce power consumption inthe electronic device. Reduction in the power consumption can berealized both dynamically and statically by allowing each trafficmanager to drop cells of selected traffic types by comparison to one ormore of multiple threshold levels in each cycle. Enabling each trafficmanager to independently determine the total amount of packet data to bereceived and which cells to drop leads to a distributed implementationin the electronic device. Using the traffic shapers and thecorresponding token bucket system to limit packet processing furtherhelps to manage power consumption based on switching.

The details of one or more implementations are set forth in theaccompanying drawings and the description below. Other features,aspects, and advantages will become apparent from the description, thedrawings, and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an example electronic device that includes trafficmanagers.

FIG. 2 illustrates an example electronic device that includes trafficmanagers along with one or more ingress arbiters that each includes atraffic shaper.

FIG. 3 illustrates an example process employed by a traffic manager inan electronic device to store packet data in buffers.

FIG. 4 illustrates an example process used by an ingress arbiter tomanage the rate of traffic fed to an ingress packet processor.

Like reference numbers and designations in the various drawings indicatelike elements. Various embodiments shown in the figures are merelyillustrative representations and are not necessarily drawn to scale.

DETAILED DESCRIPTION

FIG. 1 illustrates an example electronic device 100 that includes e.g.Traffic managers 106 a, 106 b and 106 c. The electronic device 100 alsoincludes ingress ports, ingress arbiters to which the ingress ports areconnected, and corresponding ingress packet processors, e.g., ingressports 102 a 1 and 102 a 2 connected to ingress arbiter 103 a, whichreceives packets from ingress ports 102 a 1 and 102 a 2 and forwardsinterleaved packets to ingress packet processor 102 a; ingress ports 102b 1 and 102 b 2 connected to ingress arbiter 103 b, which receivespackets from ingress ports 102 b 1 and 102 b 2 and forwards interleavedpackets to ingress packet processor 102 b; and ingress ports 102 c 1 and102 c 2 connected to ingress arbiter 103 c, which receives packets fromingress ports 102 c 1 and 102 c 2 and forwards interleaved packets toingress packet processor 102 c. A crossbar 104 in the electronic device100 connects the ingress packet processors to the traffic managers. Theingress packet processors operate on packets serially. The electronicdevice 100 also includes egress packet processors 108 a, 108 b and 108c, and corresponding transmit buffers 109 a, 109 b and 109 c,respectively, which connect the egress packet processors to groups ofegress ports. For example, egress packet processor 108 a is connectedthrough transmit buffer 109 a to egress ports 108 a 1 and 108 a 2;egress packet processor 108 b is connected through transmit buffer 109 bto egress ports 108 b 1 and 108 b 2; and egress packet processor 108 cis connected through transmit buffer 109 c to egress ports 108 c 1 and108 c 2.

Each traffic manager includes one or more buffers to store packet data,e.g., cells, that are to be sent either to packet queues (e.g., uponarrival of the last cell for the packet), or to egress packet processors(e.g., in case of cut-through traffic), e.g. traffic manager 106 aincludes buffers 106 a 1 and 106 a 2; traffic manager 106 b includesbuffers 106 b 1 and 106 b 2; and traffic manager 106 c includes buffers106 c 1 and 106 c 2. Traffic manager 106 a forwards cells stored in thebuffers 106 a 1 or 106 a 2, or both, to the egress packet processor 108a to which it is connected, for transmitting through one or more egressports in the group of ports coupled to the egress packet processor 108a, e.g., egress ports 108 a 1 or 108 a 2, or both. Similarly, trafficmanager 106 b forwards cells stored in the buffers 106 b 1 or 106 b 2,or both, to the egress packet processor 108 b, for transmitting throughone or more egress ports, e.g., egress ports 108 b 1 or 108 b 2, orboth, in the group of ports coupled to the egress packet processor 108b; and traffic manager 106 c forwards cells stored in the buffers 106 c1 or 106 c 2, or both, to the egress packet processor 108 c, fortransmitting through one or more egress ports, e.g., egress ports 108 c1 or 108 c 2, or both, in the group of ports coupled to the egresspacket processor 108 c.

In some implementations, the electronic device 100 is a portion of anetwork device e.g., a network switch e.g., an Ethernet switch thatincludes data sources. In some other implementations, the electronicdevice 100 is a network device, e.g., network switch or networkinterface card (NIC), and data sources are external to the networkswitch. In either case, the electronic device 100 performs forwardingoperations on packet data at very high speeds, e.g., potentially on theorder of tens of thousands of bits per second (bps), with highefficiency, e.g., minimum buffering and no buffer overflow in thedevice. In some other implementations, the electronic device 100 is ageneral-purpose processing unit or a storage device.

The traffic manager, ingress packet processors, egress packetprocessors, buffers, packet queues and port groups described in thisspecification can be implemented in a number of technologies. Forexample, a buffer or a packet queue includes components that can beimplemented using combinational logic circuitry, e.g., logic gates,flip-flops and registers. A buffer or a packet queue also includesmemory components that can be implemented using memory chips or befabricated on one integrated circuit with the rest of the transmitbuffer. The traffic manager logic can be implemented as a programmedmicroprocessor, a programmed microcontroller, an application-specificintegrated circuit (ASIC), or a programmed processor core or set ofprocessor cores on a larger integrated circuit. A port group can includephysical ports or logical ports. A port group includes a set ofserializer/deserializer (SERDES) lanes operating at a particular rate,e.g., 10 Gbps, 25 Gbps, 50 Gbps, or 100 Gbps each. A physical port isassociated with one or more SERDES lanes. For example, a 100 Gbps portcan be associated with ten 10 Gbps lanes, four 25 Gbps lanes, two 50Gbps lanes, or one 100 Gbps lane, depending on the underlying SERDEStechnology used. Similar to a physical port, a logical port is alsoassociated with a destination; however, a logical port includes multiplephysical connections to the destination. A logical port can beimplemented as one or more physical ports. A logical port can be boundto one or more aggregate port buffers. A crossbar is a set of wires thatinterconnect the ingress packet processors to traffic managers. TheWires can be entirely internal to the device, span multiple devices, ora hybrid in which some ingress packet processors and traffic managersare connected using internal wires only, while other ingress packetprocessors and traffic managers are connected using wires that spanmultiple devices. This specification will describe operations performedby these and related components in various implementations of thecomponents, and refer to the components as being “configured to” performthe operations. This should be understood to mean that the componentsinclude hardware, firmware, software, circuitry, or a combination ofthem that in operation cause the components to perform the operations.

In some implementations, each group of egress ports is associated withdifferent destinations. In some implementations, a destination isanother electronic device, e.g., another Ethernet switch, or aperipheral device, e.g., a packet processor.

Network traffic packets for various communications channels are receivedat the electronic device 100 through one or more ingress ports 102 a 1,102 a 2, 102 b 1, 102 b 2, 102 c 1 and 102 c 2, for forwarding towardstheir respective destinations through one or more egress ports in theegress ports 108 a 1, 108 a 2, 108 b 1, 108 b 2, 108 c 1 and 108 c 2.While waiting to be transmitted through the egress ports, the cells ofpacket data are temporarily stored in the buffers, before being sent topacket queues corresponding to the target egress ports. The trafficmanagers 106 a, 106 b and 106 c manage storage of the packet data in thebuffers in units of cells.

The ingress packet processors in the electronic device 100, e.g.,ingress packet processors 102 a, 102 b and 102 c, receive packets fromvarious external data sources through the ingress ports, via therespective ingress arbiters that are coupled to respective ingresspacket processors. The crossbar 104 receives packet data from theingress packet processors and broadcasts the cells of packet data to allthe traffic managers in parallel. State information is included witheach cell, which provides information on the target egress port throughwhich the corresponding packet is to be transmitted. Each trafficmanager determines, by reading the state information provided along witheach cell, which cells are to be written to buffers managed by thetraffic manager.

A traffic manager determines that a cell is to be written to a buffermanaged by the traffic manager when the packet corresponding to the cellis destined to be transmitted through an egress port connected to thetraffic manager. For example, traffic manager 106 a receives all cellsbroadcast by the crossbar 104. Upon reading the state informationprovided with each cell, traffic manager 106 a determines that somecells belong to packets that are to be transmitted through egress port108 a 1, or 108 a 2, or both, while the remaining cells correspond topackets that are to be transmitted through egress ports in other portgroups. The traffic manager 106 a accordingly determines that the cellsbelonging to packets that are to be to be transmitted through egressport 108 a 1 or 108 a 2 or both, are to be written to buffers managed bythe traffic manager 106 a, while the remaining cells, which are destinedfor egress ports managed by other traffic managers, can be dropped uponreading their state information to compute the total amount of data tobe written, as described below.

Since the crossbar 104 broadcasts the packet data with state informationto all the traffic managers, each traffic manager determines, uponreading the state information for all the cells, the total amount ofdata to be written, which includes the cells that are to be written tobuffers managed by the traffic manager, and the cells that are to bewritten to buffers managed by other traffic managers, before the cellsare sent either to the egress packet processors, or linked to packetqueues upon arrival of the last cell. For example, each of trafficmanagers 106 a, 106 b and 106 c receives all the packet data cellsbroadcast by the crossbar 104. Traffic manager 106 a determines, byreading the state information from all the cells: (i) which of thesecells are to be written to buffers managed by the traffic manager, e.g.,buffers 106 a 1 or 106 a 2, or both, and (ii) the total amount of datato be written to all the buffers across all the traffic managers, e.g.,buffers 106 a 1, 106 a 2, 106 b 1, 106 b 2, 106 c 1 and 106 c 2.Similarly, each of traffic managers 106 b and 106 c determines, from thestate information provided with all the cells broadcast by the crossbar104: (i) which of these cells are to be written to buffers managed bythe traffic manager, e.g., buffers 106 b 1 or 106 b 2, or both, managedby traffic manager 106 b, or buffers 106 c 1 or 106 c 2, or both,managed by traffic manager 106 c, and (ii) the total amount of data tobe written to all the buffers across all the traffic managers.

Since all the traffic managers receive the same cells that are broadcastby the crossbar 104, each traffic manager gets an identical view of thepacket data. Accordingly, the total amount of data computed by eachtraffic manager is the same. In this manner, each traffic managerindependently determines, using a distributed approach, the total amountof data to be written across all the buffers.

When a traffic manager writes packet data to a buffer, device power isconsumed. The consumption of power can be high, e.g., for high data ratetraffic or a large number of traffic sessions, in which the trafficmanagers write a large amount of cells to the buffers. For example, ifthere are N traffic managers, each writing M cells per clock cycle, thetotal number of write operations in a clock cycle is N×M. N and M areboth integers greater than zero. For a large N, or a large M, or both,the total power consumed in write operations to the buffers can be high,which can be a major portion of the energy expenditure by the electronicdevice 100.

In some implementations, the total energy consumed by the trafficmanagers in writing to the respective buffers is controlled to be withincertain budgeted values by limiting the total number of writes that areperformed across all the traffic managers in a clock cycle. For example,in some implementations, the electronic device 100 is configured suchthat the total number of write operations per clock cycle across alltraffic managers is limited to a value that is set to limit the totalenergy consumed by the traffic managers in performing write operationswithin the budgeted values. In such implementations, each trafficmanager independently determines which cells to drop, as described ingreater detail below.

As noted above, each traffic manager determines, based on theinformation broadcast by the crossbar 104 to all the traffic managers,(i) the cells that are to be written to buffers managed by the trafficmanager, and (ii) the total amount of data to be written to all thebuffers across all the traffic managers. Each traffic manager comparesthe total amount of data to be written across all the traffic managers,represented as a total number of write operations, to a predeterminedthreshold level, e.g., a preselected number of write operations; if thetraffic manager determines that the total amount of data to be writtenexceeds the threshold level, then the traffic manager decides to dropsome of the cells it could write to buffers managed by the trafficmanager.

In some implementations, each traffic manager determines the number ofcells to drop such that the total amount of data to be written acrossall the traffic managers becomes less than the threshold level. In otherimplementations, each traffic manager determines the number of cells todrop such that a prorated share of its contribution to the total amountof data to be written across all the traffic managers becomes less thanthe threshold level. In some implementations, a traffic managerconsiders several factors to determine whether to accept or drop cells.This factors include the type of packet (e.g., unicast, multicast,unicast with additional copies, multicast with additional copies, amongothers); the destination type (e.g., central processing unit (CPU) orfront panel port, among others); the packet source type (e.g., CPU orfront panel port, among others); source port and destination port;current drop state of the packet queue to which the cell will be linked(for unicast traffic); and drop rate from a given source or destination(e.g., based on previous drops to provide fairness).

As an illustrative example, in a clock cycle, each of traffic managers106 a, 106 b and 106 c receives, from the crossbar 104, informationabout cells that are to be written to the buffers across the trafficmanagers, e.g. Traffic manager 106 a determines, by analyzing thisinformation, that the total number of write operations across alltraffic managers 106 a, 106 b and 106 c exceeds the threshold level by12 cells. Each of the other traffic managers 106 b and 106 cindependently determines the same thing. In some implementations, inresponse to such a determination, traffic manager 106 a drops 12 cellsfrom the number of cells that were to be written to the buffers 106 a 1and 106 a 2. Similarly, each of the other traffic managers 106 b and 106c independently drops 12 cells from the respective cells that were to bewritten to their respective buffers. In other implementations, for thisexample, in response to the determination that the total number of writeoperations across all traffic managers exceeds the threshold level by 12cells, traffic manager 106 a decides to drop its prorated share ofcells, i.e., 4 cells, from the number of cells that were to be writtento the buffers 106 a 1 and 106 a 2. Similarly, each of the other trafficmanagers 106 b and 106 c independently decides to drop its respectiveprorated share of cells from the respective cells that were to bewritten to their respective buffers. In these implementations, theprorated share of each traffic manager is the same. In someimplementations, the data are dropped, prior to be written to thebuffers, in the current clock cycle. In some implementations, the datathat are received in the next clock cycle are dropped.

In some cases, the number of cells that were to be written by a trafficmanager to its buffers could be equal to or less than the thresholdlevel. In such cases, the traffic manager drops all the cells that wereto be written to its buffers.

In some implementations, a traffic manager, e.g., any of trafficmanagers 106 a, 106 b and 106 c, drops X consecutive cells, irrespectiveof the traffic type of each cell. In other implementations, a trafficmanager selects the X cells to drop, from the cells that are to bewritten to the buffers managed by the traffic manager, by consideringthe traffic types of the cells. For example, a traffic manager mayreceive cells of different types of traffic, e.g., unicast, CPU traffic,multicast, broadcast, or mirrored, traffic, or a combination of these.The traffic manager may determine to drop broadcast cells first. If thetraffic manager cannot achieve the target of X dropped cells byselecting only broadcast cells, then the traffic manager may selectmulticast cells as well. If the X cells target is still not achieved,then the traffic manager may select cells for mirrored traffic, followedby CPU traffic, and, finally, unicast cells. A different order ofselecting the traffic types is also possible.

In some implementations, the order in which cells are selected using thetraffic type depends on the priority or importance of the traffic types.In the example provided above, unicast, CPU traffic, mirrored traffic,multicast traffic, and broadcast traffic have descending levels ofpriority, in that order. Accordingly, when selecting cells based ontraffic type, the traffic manager first selects cells of the lowestpriority traffic, e.g., broadcast traffic, followed by multicast,mirrored traffic, CPU traffic, and unicast, in that order. If thepriorities of different types of traffic are different, then the orderof selection also may be different. For example, in someimplementations, multicast traffic has higher priority than mirroredtraffic. In such implementations, the traffic manager selects, fordropping, mirrored traffic cells before selecting multicast cells. Inthis manner, in some implementations, to satisfy power constraints, atraffic manager selectively determines what types of traffic to bewritten, and what type of traffic cells to drop: first dropping cells oftraffic types that have lower priority, and progressively selectingcells of higher priority traffic types if needed.

In some implementations, a traffic manager drops cells of a packet, ofwhich some other cells were previously received and stored in a buffer,e.g., when the total amount of packet data to be written was less thanthe threshold. Upon dropping one or more cells of the packet, the packetis considered corrupted. In such implementations, upon dropping one ormore cells of the packet, the traffic manager also removes, from thebuffer, the other cells of the packet that it has stored in previousclock cycles.

In some implementations, a traffic manager checks the total amount ofdata to be written with multiple thresholds, to determine the types ofcells to write to buffers, or to drop. As an example, in someimplementations, traffic managers in the electronic device 100 comparethe total number of cells to be written to three threshold levels—first,second and third thresholds. The first threshold has the lowest value,e.g., 12 cells, and the third threshold has the highest value, e.g., 24cells, with the second threshold being in the middle, e.g., 18 cells, ofthe first and third thresholds in value. The traffic managers receivetraffic of three different types—multicast traffic, CPU traffic andunicast traffic, with multicast traffic having the lowest priority, CPUtraffic having higher priority than multicast traffic, and unicasttraffic having the highest priority.

In the above example implementation, if a traffic manager, e.g., trafficmanager 106 a, determines that the total number of cells to be writtenis equal to or greater than the first (lowest) threshold but less thanthe second threshold, then the traffic manager selects the traffic typewith the lowest priority, e.g., multicast traffic, to drop. Accordingly,prior to the buffer write operations, the traffic manager drops allmulticast cells that were to be written to buffers managed by thetraffic manager. If the traffic manager determines that the total numberof cells to be written is equal to or greater than the second thresholdbut less than the third threshold, then the traffic manager selects, fordropping, the multicast traffic and further selects cells of the traffictype with the next higher priority, e.g., CPU traffic. Accordingly,prior to data being written to the buffers, the traffic manager dropsall multicast cells and all CPU traffic cells that were to be written tobuffers managed by the traffic manager. However, if the traffic managerdetermines that the total number of cells to be written is equal to orgreater than the highest threshold, e.g., the third threshold, then thetraffic manager selects, for dropping, cells of the traffic type withthe highest priority, e.g., unicast cells, in addition to selectinglower priority traffic, e.g., multicast traffic and CPU traffic.

Different numbers of threshold levels, e.g., two, four or fivethresholds, apart from the single threshold or triple threshold levelsdescribed above, can be used in different implementations. The number ofthreshold levels used by the electronic device 100 can correspond to thenumber of traffic types processed by the electronic device. As anexample, in some implementations, the electronic device 100 isconfigured to process five different types of traffic: broadcasttraffic, multicast traffic, mirrored traffic, CPU traffic and unicasttraffic, with ascending values of priority. In such implementations, theelectronic device 100 uses five threshold levels. If a traffic manager,e.g., traffic manager 106 a, determines that the total number of cellsto be written in the next cycle is equal to or greater than the lowestthreshold but less than the next higher threshold, then the trafficmanager selects the traffic type with the lowest priority, e.g.,broadcast traffic, to drop. If the total number of cells in the nextcycle is greater than the second lowest threshold, then the trafficmanager selects for dropping both broadcast traffic and multicasttraffic. If the total number of cells in the next cycle is greater thanthe third threshold, then the traffic manager selects for droppingbroadcast, multicast and mirrored traffic. If the total number of cellsin the next cycle is greater than the fourth or second highestthreshold, then the traffic manager selects for dropping all traffictypes except unicast traffic, e.g., broadcast, multicast, mirrored andCPU traffic. Finally, if the total number of cells in the next cycle isgreater than the fifth or highest threshold, then the traffic managerselects for dropping all traffic types, including unicast traffic.

In some implementations, the types of traffic handled by a trafficmanager, and correspondingly, the number of threshold levels used,correspond to the number of concurrent write operations supported by thetraffic manager. Considering the example above with five thresholdlevels, in some implementations, the traffic manager 106 a supports fiveconcurrent write operations, each for a different type of traffic:broadcast, multicast, mirrored, CPU and unicast traffic. The trafficmanager manages five buffers to support the five write operations: abuffer that is configured to store cells of broadcast packets; a bufferthat is configured to store cells of multicast packets; a buffer that isconfigured to store cells of mirrored traffic packets; a buffer that isconfigured to store cells of CPU traffic packets; and a buffer that isconfigured to store cells of unicast packets. In such cases, when thetraffic manager selects broadcast cells to drop, then prior to writingdata to the broadcast buffer, the traffic manager does not store anycells in the broadcast buffer. Similar operations are performed for theother buffers when cells of other traffic types are selected. Differentpriority values are assigned to the different egress ports, or thebuffers, or both, e.g., depending on the type of traffic serviced by theegress ports and the buffers. For example, an egress port andcorresponding buffer that serve multicast traffic can have a lowerpriority than an egress port and corresponding buffer that serve unicasttraffic.

In the above manner, one or more threshold levels can be enabled indifferent implementations, depending on the power consumption policiesin the electronic device 100 and the types of traffic processed by theelectronic device. Using the different threshold levels, the electronicdevice can dynamically determine what types of traffic to drop to managepower consumption of the electronic device.

In some implementations, one of more of the different threshold levelscan be enabled or disabled depending on the traffic type. For example, atraffic manager, which is configured to use five threshold levels asdescribed by the example above, determines that it will receive only CPUtraffic and unicast traffic. In such cases, the traffic manager comparesthe total number of cells to be written to only the two highestthreshold levels, which correspond to CPU traffic and unicast traffic.However, at a different time, the traffic manager determines that itwill receive broadcast, multicast and unicast traffic. At this time, thetraffic manager compares the total number of cells to the first andsecond threshold levels, e.g., the lowest two thresholds, whichcorrespond to broadcast and multicast traffic, and the fifth thresholdlevel, e.g., the highest threshold, which corresponds to unicasttraffic. In this manner, an electronic device 100 that supports multiplethreshold levels can dynamically determine which threshold levels touse.

In some implementations of the electronic device 100, the one or morethreshold levels can be set by a user, e.g., by an operator whoconfigures the electronic device and sets the threshold levels inaccordance with targeted limits of power consumption by the electronicdevice. In such implementations, the user can change the thresholdlevels, depending on changes in the targeted limits. In someimplementations, the one or more threshold levels are set to constantvalues, e.g., at the time of device manufacture, and cannot be changedby the user.

In some implementations, when the crossbar 104 broadcasts the cells withstate information from the ingress packet processors to the trafficmanagers, different traffic managers can receive the information atdifferent times due to varying propagation delays in the electronicdevice 100. This can be the case, for example, when the physicaldistances from different ingress packet processors to different trafficmanagers are different. For example, the physical distance between theingress packet processor 102 a and the traffic manager 106 a can be lessin the circuit of the electronic device 100, compared to the physicaldistance between the ingress packet processor 102 a and the trafficmanager 106 c. In contrast, the physical distance between the ingresspacket processor 102 c and the traffic manager 106 c can be lesscompared to the physical distance between the ingress packet processor102 c and the traffic manager 106 a.

In such implementations, one or more traffic managers perform operationsto ensure that all traffic managers access the information from thecrossbar 104 at the same time, so that the computation of the totalamount of data to be written at the same time is consistent across alltraffic managers. Upon receiving the data from the crossbar 104, one ormore traffic managers delay reading of the information by time intervalsthat account for the variance in transmission from the ingress packetprocessors to the traffic managers. For example, in some cases, thecrossbar 104 broadcasts, to traffic managers 106 a, 106 b and 106 c,cells with state information from the ingress packet processor 102 a.The traffic manager 106 a receives the information from the crossbar 104earlier than other traffic managers, while traffic manager 106 breceives the information before the traffic manager 106 a. Logic in thetraffic manager 106 a, e.g., filtering logic, computes the transmissiontimes from the ingress packet processor 102 a to the traffic managers106 b and 106 c by analyzing the physical delays in the circuit, anddetermines that the maximum transmission time from the ingress packetprocessor 102 a is to the traffic manager 106 c. Following thisdetermination, traffic manager 106 a computes the additional delay asthe difference between the time that it receives the information fromthe crossbar 104 and the time at which traffic manager 106 c receivesthe same information from the crossbar 104. After receiving theinformation from the crossbar 104, the traffic manager 106 a waits forthis additional delay period before it reads the received information.

Similarly, the traffic manager 106 b independently determines that themaximum transmission time from the ingress packet processor 102 a is tothe traffic manager 106 c. Accordingly, traffic manager 106 b computesits additional delay as the difference between the time that it receivesthe information from the crossbar 104, and the time at which the trafficmanager 106 c receives the same information from the crossbar 104.Traffic manager 106 b waits for this additional delay period before itreads the information received from the crossbar 104.

The traffic manager 106 c also independently determines that the maximumtransmission time from the ingress packet processor 102 a is to itself.Following this determination, upon receiving the information from thecrossbar 104, traffic manager 106 c proceeds to read the informationwithout waiting for any additional time period, since other trafficmanagers have already received the information from the crossbar and areexpected to read the information at this time.

In the above manner, each traffic manager independently determineswhether to delay reading of the state information received from thecrossbar 104, such that all traffic managers read the information at thesame time and thereby compute the total amount of data insynchronization. As described above, different traffic managers maydelay reading the information by different time periods. In the exampleabove, traffic manager 106 a delays reading the information by a longertime period compared to the traffic manager 106 b, since traffic manager106 a receives the information from the crossbar 104 earlier compared totraffic manager 106 b.

In some implementations, in addition, or as an alternative, to reducingpower consumption by controlling storage of cells in buffers, powerconsumption is reduced by controlling the switching operations performedby the electronic device 100. Switching is controlled by limiting therate of packet processing by the ingress packet processors. In someimplementations, the rate of packet processing by an ingress packetprocessor is limited by limiting the packet rate going into the ingresspacket processor, e.g., by traffic shaping using a token bucketmechanism, as described below.

FIG. 2 illustrates an example electronic device 200 that includestraffic managers 202 a, 202 b and 202 c, along with ingress arbiters,e.g., ingress arbiters 203 a, 203 b and 203 c that each includes atraffic shaper, e.g., ingress arbiters 203 a, 203 b and 203 c includetraffic shapers 210 a, 210 b and 210 c respectively. The electronicdevice 200 also includes ingress ports that are connected to the ingressarbiters, e.g., ingress ports 202 a 1 and 202 a 2 connected to ingressarbiter 203 a; ingress ports 202 b 1 and 202 b 2 connected to ingressarbiter 203 b; and ingress ports 202 c 1 and 202 c 2 connected toingress arbiter 203 c. Each traffic shaper regulates packet rate acrossall the ingress ports associated with the corresponding ingress arbiter.A crossbar 204 in the electronic device 200 connects the ingress packetprocessors to the traffic managers. The electronic device 100 alsoincludes egress packet processors 208 a, 208 b and 208 c, andcorresponding transmit buffers 209 a, 209 b and 209 c, respectively,which connect the egress packet processors to groups of egress ports.For example, egress packet processor 208 a is connected through transmitbuffer 209 a to egress ports 208 a 1 and 208 a 2; egress packetprocessor 208 b is connected through transmit buffer 209 b to egressports 208 b 1 and 208 b 2; and egress packet processor 208 c isconnected through transmit buffer 209 c to egress ports 208 c 1 and 208c 2.

Each traffic manager includes one or more buffers to buffer packet data,e.g. traffic manager 206 a includes buffers 206 a 1 and 206 a 2; trafficmanager 206 b includes buffers 206 b 1 and 206 b 2; and traffic manager206 c includes buffers 206 c 1 and 206 c 2. Traffic manager 206 aforwards cells stored in the buffers 206 a 1 or 206 a 2, or both, to apacket queue corresponding to the egress packet processor 208 a, fortransmitting through one or more egress ports in the group of portscoupled to the egress packet processor 208 a, e.g., egress ports 208 a 1or 208 a 2, or both. Similarly, traffic manager 206 b forwards cellsstored in the buffers 206 b 1 or 206 b 2, or both, to a packet queue foregress packet processor 208 b, for transmitting through one or moreegress ports, e.g., egress ports 208 b 1 or 208 b 2, or both, in thegroup of ports coupled to the egress packet processor 208 b; and trafficmanager 206 c forwards cells stored in the buffers 206 c 1 or 206 c 2,or both, to a packet queue for egress packet processor 208 c, fortransmitting through one or more egress ports, e.g., egress ports 208 c1 or 208 c 2, or both, in the group of ports coupled to the egresspacket processor 208 c.

The ingress ports in the electronic device 200, e.g., ingress ports 202a 1, 202 a 2, 202 b 1, 202 b 2, 202 c 1 and 202 c 2, receive packetsfrom various external data sources and forward the packets to therespective ingress arbiters to which the ingress ports are connected.The ingress arbiters 203 a, 203 b and 203 c include traffic shapers,e.g., traffic shapers 210 a, 210 b and 210 c respectively, to shape ormanage the packet data traffic that is sent to the ingress packetprocessors by limiting the packet rate fed to the ingress packetprocessors, e.g., 202 a, 202 b and 202 c respectively, coupled to theingress arbiters, as described in greater detail below. The crossbar 204receives packet data from the ingress packet processors, e.g., in unitsof cells, and broadcasts the packet data to all the traffic managers inparallel. State information is included with each cell, which providesinformation on the target egress port through which the correspondingpacket is to be transmitted. Each traffic manager determines, by readingthe state information provided along with each cell, which cells are tobe written to buffers managed by the traffic manager.

In some implementations, the electronic device 200 is similar to theelectronic device 100 and performs functions that include functionsperformed by the electronic device 100, as described above. In suchimplementations, the crossbar 204 performs functions that includefunctions performed by the crossbar 104, and the traffic managers 206 a,206 b and 206 c perform functions that include functions similar tothose performed by the traffic managers 106 a, 106 b and 106 c describedabove.

In some implementations, there is one ingress arbiter for each group ofingress ports, connecting the ingress port group to the correspondingingress packet processor. For example, ingress arbiter 203 a connectsthe group of ingress ports that include ingress ports 202 a 1 and 202 a2 to the corresponding ingress packet processor 202 a; ingress arbiter203 b connects the group of ingress ports that include ingress ports 202b 1 and 202 b 2 to the corresponding ingress packet processor 202 b; andingress arbiter 203 a connects the group of ingress ports that includeingress ports 202 c 1 and 202 c 2 to the corresponding ingress packetprocessor 202 c. However, in other implementations, there are multipleingress arbiters connecting a group of ingress ports to thecorresponding ingress packet processor, or an ingress arbiter connectsmultiple groups of ingress ports to the respective ingress packetprocessors.

As noted above, the traffic shapers 210 a, 210 b and 210 c are employedin the electronic device 200 to limit the rate at which packet data isfed to the respective ingress packet processors connected to eachingress arbiter. For example, traffic shaper 210 a is used to limit therate at which packet data, received from one or more ingress ports 202 a1 and 202 a 2, is fed to the ingress packet processor 202 a that isconnected to the output of the ingress arbiter 203 a. Similarly, trafficshaper 210 b is used to limit the rate at which packet data, receivedfrom one or more ingress ports 202 b 1 and 202 b 2, is fed to theingress packet processor 202 b that is connected to the output of theingress arbiter 203 b; and traffic shaper 210 c is used to limit therate at which packet data, received from one or more ingress ports 202 c1 and 202 c 2, is fed to the ingress packet processor 202 c that isconnected to the output of the ingress arbiter 203 c. By limiting therate at which packet data is fed to the ingress packet processors, theamount of processing performed by the ingress packet processors islimited. Additionally, the number of cells that are sent to the trafficmanagers per clock cycle is also lowered, which in turn reduces theprobability of cells being dropped by the traffic managers. These havethe effect of reducing the power consumption by the electronic device.Additionally, by reducing the probability of cells being dropped by thetraffic managers, fewer packets are lost by the electronic device 200 asa result of the cell dropping actions described above, leading toimprovements in reliability of the electronic device 200 andimprovements in data transmission.

In some implementations, each of the traffic shapers 210 a, 210 b and210 c is used to limit the rate of data flow to the correspondingingress packet processor by using a token bucket mechanism. In suchimplementations, each traffic shaper starts out with a logical bucketfull of “tokens.” If packets are received from one or more of theingress ports connected to the corresponding ingress arbiter, theingress arbiter consumes a token from the bucket of tokens available toits traffic shaper, when the ingress arbiter forwards a packet, orschedules a packet to be forwarded, to the connected ingress packetprocessor in one clock cycle. A packet is forwarded to the ingresspacket processor only if there are tokens available in the bucket. Ifthere are no tokens available in the bucket, then, in someimplementations, the ingress arbiter drops the packets received from theingress ports, until the bucket is replenished with new tokens. In otherimplementations, the ingress arbiter temporarily stores the additionalpackets in input buffers managed by the ingress arbiter, until tokensbecome available to forward the packets to the ingress packet processor;or the buffer storage time reaches a threshold value, at which time theadditional packets are dropped.

In some implementations, the total number of tokens placed in a bucketcorresponds to the target rate at which packet processing is done by therespective ingress packet processor. For example, ingress packetprocessor 202 a can be configured to process packets at 250 Megahertz(MHz). In such implementations, the traffic shaper 210 a generates up to250 million tokens per second in its bucket. In each of the 250 millionclock cycles in one second, the traffic shaper 210 a allows the ingressarbiter 203 a to consume one token, thereby allowing one packet from oneof the corresponding ingress ports, e.g., one of ingress ports 202 a 1and 202 a 2, to be sent to the ingress packet processor 202 a. If theingress arbiter 203 a receives packets from the ingress ports at ahigher rate, e.g., at 0.5 Gigahertz (GHz) rate, which corresponds totwice the number of packets per clock cycle compared to 250 MHz, thenthe ingress arbiter 203 a will drop or buffer any additional packetsreceived per clock cycle, instead of sending the additional packets tothe ingress packet processor 202 a.

A traffic shaper replenishes its bucket, e.g., refills the bucket withnew tokens, periodically. In some implementations, a traffic shaper,e.g., traffic shaper 210 a, refills its bucket after the expiry of arefresh time interval. The traffic shaper uses a timer that counts downto the refresh interval. The refresh interval can be set in terms ofclock cycles, e.g., 1500 clock cycles. When the timer expires, e.g.,after every 1500 clock cycles in the above example, the traffic shaperreplenishes the expended tokens in its bucket.

In some implementations, the refresh time interval is a programmablevalue. In such implementations, a user of the electronic device 200specifies a target packet processing rate for the device using auser-configurable device setting. Based on the user-specified targetprocessing rate, logic circuitry in the electronic device 200 sets therefresh time interval such that the traffic shapers in the electronicdevice 200 can supply packets to the respective ingress packetprocessors to achieve the target packet processing rate. Tokens areadded to the bucket according to a rate of N tokens every T clock cycles(where N and T are positive integers). One or more tokens are removedfrom the bucket on every transmission, depending on what is being shaped(e.g., packets, bytes, or cells). If nothing is transmitted (e.g., thedevice is idle), then the bucket will keep receiving tokens (andpotentially overflow). To contain the amount of tokens that can beaccumulated, a maximum burst size is used.

In some implementations, a traffic shaper is dynamically tuned based onone or more parameters. These parameters include, for example, monitoredpower usage of the electronic device 200, the number of cell dropsobserved in the electronic device 200 to regulate power consumption bythe electronic device 200, total buffer (or buffer partition) usage ofthe electronic device 200, length of a queue (e.g., in bytes or cells)corresponding to an egress port, maximum delay (e.g., instantaneous oraverage delay) across a queue or across a subset of queues, or asuitable combination of these parameters. As an example, if theswitching activity of the electronic device increases due to receivingdata at high rates, thereby leading to increased power consumption, thenthe number of tokens that are generated by a traffic shaper foradmitting cells is dynamically adjusted to limit the increase in powerconsumption by the device. In some implementations, the token generationrate by a traffic shaper, e.g. traffic shaper 210 a, is reduced. Thisresults in limiting the number of cells that are admitted for switchingby the corresponding ingress arbiter, e.g., ingress arbiter 203 a,leading to a reduction in power consumption. Dynamically managing thetoken generation by a traffic shaper can be achieved by adjusting therefresh time interval for the traffic shaper to refill its bucket. Therefresh time interval can be increased so that the traffic shaperrefills its bucket of tokens at a slower rate, and thus decreasing therate at which its tokens are available for admitting cells.

The token generation rate for a traffic shaper is correlated to thenumber of cells that are dropped by the corresponding ingress arbiter,e.g., when the arbiter's buffer is quickly filled up due to a lower rateat which the traffic shaper supplies tokens. By monitoring the number ofcell drops by the ingress arbiter, the token generation rate of thetraffic shaper can be fine tuned to limit the number of cell drops to beless than target limits for drops, while also limiting the powerconsumption by the electronic device to be within target power levels.The number of cell drops that are observed may be presented as aninstantaneous number, an average number, or a sum over a window of time.

Conversely, if the traffic shaping in the electronic device 200 iseffectively managed to an extent such that the power consumption by thedevice is well under preset power target levels, then the electronicdevice may be capable of increasing its switching activity withoutexceeding its power consumption targets. In some implementations, thenthe number of tokens that are generated by a traffic shaper, e.g.,traffic shaper 210 a, is dynamically increased to increase the rate atwhich cells are admitted by the corresponding ingress arbiter, e.g.,ingress arbiter 203 a, which leads to an increase in switching activityand thereby increases power consumption. The traffic generation rate ofa traffic shaper is managed to ensure that the rate of admittance ofcells is managed such that the total power consumption by the electronicdevice stays within target levels.

As noted above, in some implementations, each ingress arbiter furtherincludes, in addition to traffic shapers, input buffers. The inputbuffers are used to temporarily store incoming packets until tokens areavailable to forward the packets to the corresponding ingress packetprocessor, e.g., in cases where the incoming packet rate is greater thanthe processing rate of the ingress packet processor. In someimplementations, each input buffer includes components that can beimplemented using combinational logic circuitry, e.g., logic gates,flip-flops and registers. An input buffer also includes memorycomponents that can be implemented using memory chips or fabricated onone integrated circuit with the rest of the transmit buffer.

In the above manner, the electronic device 200 uses ingress arbiterswith traffic shapers, e.g., ingress arbiters 203 a, 203 b and 203 c withtraffic shapers 210 a, 210 b and 210 c respectively, to manage thepacket processing rate by the ingress packet processors in the device,thereby manage the power consumption by the electronic device. Thetraffic shapers are used in conjunction with, or as an alternative to,the threshold comparison techniques employed by the traffic managers,e.g., traffic managers 206 a, 206 b and 206 c, whose are similar tothose described above with respect to traffic managers 106 a, 106 b and106 c in the electronic device 100.

FIG. 3 illustrates an example process 300 employed by a traffic managerin an electronic device to store packet data in buffers. In someimplementations, the process 300 is performed by the electronic device100, e.g., executed by each traffic manager 106 a, 106 b and 106 c, andsimilarly, by each of the traffic managers 206 a, 206 b and 206 c, whenstoring packet data cells in their respective buffers. Accordingly, thefollowing sections describe the process 300 as being performed by thetraffic managers 106 a, 106 b and 106 c in the electronic device 100.However, in other implementations, the process 300 may be performed byother systems or devices.

The process 300 starts when a traffic manager receives information aboutpacket data sent by one or more ingress packet processors for storage byone or more traffic managers (302). For example, each of the trafficmanagers 106 a, 106 b and 106 c receive, from the crossbar 104, abroadcast of cells corresponding to packet data sent by one or more ofthe ingress packet processors 102 a, 102 b and 102 c, along with stateinformation about the cells.

Each traffic manager computes the total amount of packet data to bewritten to buffers in the one or more traffic managers that are part ofthe same device (304). For example, each of the traffic managers 106 a,106 b and 106 c, independently computes, using the state informationreceived in the broadcast from the crossbar 104, the total number ofcells that are to be written in all the buffers managed by the trafficmanagers, e.g., buffers 106 a 1, 106 a 2, 106 b 1, 106 b 2, 106 c 1 and106 c 2.

Each traffic manager checks whether the total amount of packet data isequal to or greater than one or more predetermined threshold levels(306). For example, each of the traffic managers 106 a, 106 b and 106 cindependently compares the total number of cells, to one or morethreshold levels. In some implementations, each traffic manager comparesthe total number of cells to be written to a single threshold level.This is the case, for example, when traffic of a single type, e.g.,unicast packets, is received. In some implementations, each trafficmanager compares the total number of cells to be written to multiplethreshold levels. This is the case, for example, when traffic ofdifferent types, e.g., unicast, multicast, CPU traffic and mirroredtraffic, are received. In such implementations, different types oftraffic can have different priorities, with traffic having lowerpriority, e.g., multicast traffic, being dropped before traffic havinghigher priority, e.g., unicast traffic. In such cases, a threshold levelwith a lower value can correspond to a traffic type having a lowerpriority, while a threshold level with a higher value can correspond toa traffic type having a higher priority. A traffic manager compares thetotal amount of data to a threshold level with lower value beforecomparing the total amount of data to a threshold level with highervalue.

If a traffic manager determines that the total amount of packet data isless than the one or more threshold levels, then the traffic managerstores all cells of packet data in the corresponding buffers (308). Forexample, upon determining that the total number of cells computed isless than the one or more threshold levels, the traffic manager 106 aidentifies which of these cells are destined for egress ports coupled tothe traffic manager, e.g., one or more of egress ports 108 a 1 and 108 a2. The traffic manager 106 a stores the identified cells in the bufferscorresponding to the egress ports. For example, the traffic manager 106a stores cells of packets destined for egress port 108 a 1 in buffer 106a 1, and stores cells of packets destined for egress port 108 a 2 inbuffer 106 a 2. The traffic manager 106 a drops any remaining cells thatare destined for egress ports managed by the other traffic managers,e.g., any of egress ports 108 b 1, 108 b 2, 108 c 1 and 108 c 2.Similarly, the traffic manager 106 b identifies cells of packetsdestined for egress ports 108 b 1 or 108 b 2, or both, and stores thesecells in respective buffers 106 b 1 or 106 b 2, while discarding cellsof packets destined for egress ports managed by other traffic managers;and the traffic manager 106 c identifies cells of packets destined foregress ports 108 c 1 or 108 c 2, or both, and stores these cells inrespective buffers 106 c 1 or 106 c 2, while discarding cells of packetsdestined for egress ports managed by other traffic managers.

As noted previously, in some implementations, cells that arrive in agiven clock cycle and are stored in buffers, are portions of differentpackets. Even if a cell from a given packet is stored in a buffer in aclock cycle, not all cells for that packet may be stored. For example,another cell for that packet that arrives in a future clock cycle may bedropped, such that a traffic manager will subsequently drop all priorcells associated with the packet that have been stored in thecorresponding buffer, and all remaining cells of the packet that arrivein future clock cycles.

On the other hand, if a traffic manager determines that the total amountof packet data is equal to or greater than the one or more thresholdlevels, then the traffic manager drops some cells of packet datareceived for storage, and stores remaining cells of packet data in thecorresponding buffers (310). For example, upon determining that thetotal number of cells computed at 306 is equal to or greater than theone or more threshold levels, the traffic manager 106 a identifies whichof these cells are destined for egress ports coupled to the trafficmanager, e.g., one or more of egress ports 108 a 1 and 108 a 2. Thetraffic manager drops some of the identified cells, while storing theremaining cells destined for the egress ports coupled to the trafficmanager, in the respective buffers corresponding to the egress ports.The traffic manager also discards the cells that are destined for egressports managed by other traffic managers. Similarly, each of trafficmanagers 106 b and 106 c identifies which of the total number cells aredestined for egress ports coupled to the respective traffic manager, anddrops some of the identified cells, while storing the remaining cellsdestined for the respective managed egress ports, in the respectivebuffers corresponding to the egress ports, and also discards the cellsthat are destined for egress ports managed by other traffic managers.

In some implementations, each traffic manager first drops cells of lowerpriority traffic types, e.g., multicast traffic, e.g., when the totalnumber of cells equals or exceeds a lower threshold level, but is lessthan a higher threshold level. If the total number of cells equals orexceeds one or more higher threshold levels, then the traffic managerprogressively drops cells of higher priority traffic types, e.g., CPUtraffic, followed by unicast traffic.

In some implementations, the number of cells that are dropped is equalto the difference between the total number of cells in the current clockcycle, or a future clock cycle, and the threshold value that is equaledor exceeded. In some implementations, all cells of a particular traffictype are dropped, irrespective of the difference between the totalnumber of cells in the current clock cycle or the future clock cycle andthe threshold value that is equaled or exceeded. The determination ofwhich cells are to be dropped are made in the same clock cycle that thecells are to be written, or a clock cycle that occurs prior to the clockcycle when the cells are to be written.

In some implementations, if a traffic manager determines that the totalamount of packet data is equal to or greater than the one or morethreshold levels, then the traffic manager drops all cells for allpackets arriving at that time. In such cases, the traffic manager alsodrops previously stored cells for these packets, as well as additionalcells for these packets that are received in subsequent clock cycles.

FIG. 4 illustrates an example process 400 used by an ingress arbiter tomanage the rate of traffic fed to an ingress packet processor. In someimplementations, the process 400 is independently performed by each ofthe ingress arbiters 203 a, 203 b and 203 c, when forwarding packetsreceived from ingress ports to the corresponding ingress packetprocessors, e.g., ingress packet processors 202 a, 202 b and 202 c,respectively, that are connected to the ingress arbiters. In someimplementations, the ingress arbiters 203 a, 203 b and 203 c perform theprocess 400 using information from their respective traffic shapers 210a, 210 b and 210 c. Accordingly, the following sections describe theprocess 400 with respect to the ingress arbiters 203 a, 203 b and 203 cand their corresponding traffic shapers 210 a, 210 b and 210 c,respectively. However, in other implementations, the process 400 may beperformed by other systems or devices.

In some implementations, the process 400 is performed in conjunctionwith the process 300, which is independently performed by each of thetraffic managers 206 a, 206 b and 206 c, as described above.

The process 400 starts when an ingress arbiter determines a traffic rateat which a connected ingress packet processor is configured to transmitcell of packets to the traffic managers (402). For example, the ingressarbiter 203 a determines the packet processing rate used by the ingresspacket processor 202 a that is connected to the ingress arbiter 203 a.Similarly, the ingress arbiter 203 b determines the packet processingrate used by the ingress packet processor 202 b that is connected to theingress arbiter 203 b; and the ingress arbiter 203 c determines thepacket processing rate used by the ingress packet processor 202 c thatis connected to the ingress arbiter 203 c. In some implementations, thepacket processing rate used by the different ingress packet processorsis the same. In other implementations, different packet processing ratesare used by different ingress packet processors. As disclosed above, insome implementations, the packet processing rate used by an ingresspacket processor is configured in the device using a configuration valuespecified by a user of the electronic device 200.

An ingress arbiter assigns a number of tokens to manage a group ofingress ports that sends packets to the ingress packet processor throughthe ingress arbiter (404). For example, upon determining the packetprocessing rate for the ingress packet processor 202 a, the ingressarbiter 203 a computes a number of tokens that would enable packetprocessing by the ingress packet processor 202 a at the desired rate.The ingress arbiter 203 a assigns the computed number of tokens to thetraffic shaper 210 a, to manage the input data rate for the group ofingress ports including ingress ports 202 a 1 and 202 a 2 that areconnected to the ingress arbiter 203 a. Similarly, the ingress arbiter203 b computes a number of tokens corresponding to the target packetprocessing rate by the ingress packet processor 202 b, assigns thecomputed number of tokens to the traffic shaper 210 b, to manage theinput data rate for the group of ingress ports including ingress ports202 b 1 and 202 b 2 that are connected to the ingress arbiter 203 b; andthe ingress arbiter 203 c computes a number of tokens corresponding tothe target packet processing rate by the ingress packet processor 202 c,and assigns the computed number of tokens to the traffic shaper 210 c,to manage the input data rate for the group of ingress ports includingingress ports 202 c 1 and 202 c 2 that are connected to the ingressarbiter 203 c.

In some implementations, the tokens are implemented using an up-downcounter. In some implementations, the number of tokens assigned to eachtraffic shaper is the same across all traffic shapers. This is the case,for example, when the packet processing rate for all the ingress packetprocessors is the same. However, in other implementations, differenttraffic shapers are assigned different numbers of tokens to thecorresponding ingress port groups, e.g., when the ingress packetprocessors have different packet processing rates.

An ingress arbiter receives a cell of packet from an ingress port in thegroup of ingress ports (405). For example, the ingress arbiter 203 areceives cells corresponding to packets from one or more of the ingressports that are connected to the ingress arbiter 203 a, e.g., ingressports 202 a 1 or 202 a 2, or both. Similarly, the ingress arbiter 203 breceives cells corresponding to packets from one or more of the ingressports that are connected to the ingress arbiter 203 b, e.g., ingressports 202 b 1 or 202 b 2, or both; and the ingress arbiter 203 creceives cells corresponding to packets from one or more of the ingressports that are connected to the ingress arbiter 203 c, e.g., ingressports 202 c 1 or 202 c 2, or both.

Upon receiving a cell of a packet, an ingress arbiter checks if space isavailable in an input buffer associated with the ingress arbiter (406).For example, ingress arbiter 203 a checks to see if there is space in aninput buffer to store cells that are received.

If space is available in the input buffer, then the ingress arbitertemporarily stores the packet in an input buffer (410). For example,upon receiving a cell of packet for sending to the ingress packetprocessor 202 a, the ingress arbiter 203 a stores the cell in an inputbuffer that is maintained in the ingress arbiter if there is spaceavailable in the input buffer to store the cell. Similarly, if theingress arbiters 203 b and 203 c store their respective cells in inputbuffers if there is space available in the input buffers.

Drops occur when there is no space available in the input buffer tostore the cells before they are eligible to be forwarded. If an ingressarbiter determines that space is not available in the correspondinginput buffer, then the ingress arbiter arranges to drop the cell that isreceived (408). For example, upon receiving a cell, ingress arbiter 203a checks whether there is space available in a corresponding inputbuffer to store the newly received cell, until a token becomesavailable. If there is no space available in the buffer, e.g., thebuffer is full with waiting cells, then the ingress arbiter 203 aarranges to drop the cell. In some implementations, the ingress arbiter203 a marks the cell as an EOP cell with a drop notation and forwardsthe cell to the ingress packet processor 202 a. In some implementations,for the ingress buffer, an empty cell (e.g., with no data) with an EOPindicator is linked to the queue, since there is no buffer to store thecell. On de-queue, the empty cell is converted to a single byte cellwith error and EOP indication. When an EOP cell with a drop notation isreceived by the traffic manager, e.g., traffic manager 206 a, thetraffic manager drops the cell, along with all other cells of the packetthat are stored in the traffic manager buffers for store-and forwardpackets. If the packet is being transmitted as a cut-through packet,then the EOP cell is forward along with an error flag. The error flagforces the packet to depart the device with a bad checksum (e.g., cyclicredundancy check or CRC), such that the downstream device that receivesthe packet knows the packet has been corrupted and therefore drops thepacket. Further, any additional cells of the packet that are received insubsequent time intervals (e.g., clock cycles) are also dropped. Such acircumstance arises when the rate at which cells are received from theingress ports, e.g. ingress port 202 a 1 or 202 a 2, is significantlygreater than the rate at which cells are processed by the ingress packetprocessors, e.g., ingress packet processor 202 a, such that input bufferin the ingress arbiter gets filled with waiting cells. In a similarmanner, ingress arbiters 203 b and 203 c store arriving cells in therespective input buffers, or arrange to drop the cells if the respectiveinput buffers are full.

After storing a cell in an input buffer, an ingress arbiter checks withits traffic shaper whether a token is available (411). For example, toforward cells received from connected ingress ports that are stored inrespective buffers, each of the ingress arbiters 203 a, 203 b and 203 cquery the respective traffic shapers 210 a, 210 b and 210 c to check ifthere are tokens available in the respective bucket of tokens assignedto the respective group of connected ingress ports.

If a token is available, then an ingress arbiter forwards the cell tothe ingress packet processor and decrements the count of availabletokens by one (414). For example, if the ingress arbiter 203 adetermines, using information from the traffic shaper 210 a, that atoken is available in the bucket of tokens assigned to the ingress portgroup including ingress ports 202 a 1 and 202 a 2, then the ingressarbiter 203 a forwards a cell of a packet, which is stored in a bufferof the ingress arbiter, to the ingress packet processor 202 a, andconsumes a token from its bucket. Similarly, if the ingress arbiter 203b determines, using information from the traffic shaper 210 b, that atoken is available in the bucket of tokens assigned to the ingress portgroup including ingress ports 202 b 1 and 202 b 2, then the ingressarbiter 203 b forwards a cell of a packet to the ingress packetprocessor 202 b, consuming a token from its bucket; and if the ingressarbiter 203 c determines, using information from the traffic shaper 210c, that a token is available in the bucket of tokens assigned to theingress port group including ingress ports 202 c 1 and 202 c 2, then theingress arbiter 203 c forwards a cell of a packet to the ingress packetprocessor 202 c, consuming a token from its bucket.

In some implementations a cell becomes eligible for forwarding to theingress packet processor if a token is available. In such cases, thecell may not be immediately forwarded by an ingress arbiter to thepacket processor for other reasons, e.g., for flow control. In someimplementations, a traffic shaper consumes a token from its bucket onlyupon forwarding an initial cell of a packet, which is also referred toas a start of packet (SOP) cell. In such cases, for other cells ofpacket, e.g., middle of packet (MOP) cells or end of packet (EOP) cells,no tokens are consumed. This ensures that one token is consumed perpacket, and not more.

On the other hand, if a token is not available, then the cells remainstored in a buffer of an ingress arbiter until a token becomesavailable. In such cases, an ingress arbiter periodically checks whethera token refresh has occurred (412). For example, a traffic shaperincludes a token bucket counter that receives new tokens according to aknown period (e.g., periodically). The number of tokens to be added tothe bucket is configured such that the tokens per period equal thetarget processing rate. If there are no tokens available, then any cellbuffered due to lack of tokens waits until the token bucket isrefreshed. The amount of tokens that can be accumulated may be limitedby a maximum burst size.

Each traffic shaper periodically checks whether the associated tokenrefresh has occurred. In some implementations, the token refresh is abackground process. A timer is maintained determine the time since thelast refresh. If the time since last refresh exceeds a configuredrefresh interval, then a token refresh is performed. If a token refreshoccurs, then the ingress arbiter makes the cells stored in the buffereligible for forwarding to the corresponding ingress packet processor(416). For example, when ingress arbiter 203 a determines that tokensare again available, the ingress arbiter 203 a resumes forwarding thecells that are stored in the corresponding input buffer, to the ingresspacket processor 202 a.

In some implementations, if a token is not available (411), then aningress arbiter immediately arranges to drop the cell, instead ofleaving the cell stored in an input buffer until a token refresh occurs.In such cases, the ingress arbiter changes the cell to an EOP cell andforwards to its ingress packet processor with a drop notation, such thatthe cell is dropped, along with other stored cells belonging to the samepacket, when it reaches the traffic manager. In such implementations,the operation of checking for token refresh is optional.

The processes and logic flows described in this document can beperformed by one or more programmable processors executing one or morecomputer programs to perform the functions described herein. Theprocesses and logic flows can also be performed by, and apparatus canalso be implemented as, special purpose logic circuitry, e.g., a fieldprogrammable gate array (FPGA) or an ASIC.

Processors suitable for the execution of a computer program include, byway of example, both general and special purpose microprocessors, andany one or more processors of any kind of digital computer. Generally, aprocessor will receive instructions and data from a read only memory ora random access memory or both.

While this document may describe many specifics, these should not beconstrued as limitations on the scope of an invention that is claimed orof what may be claimed, but rather as descriptions of features specificto particular embodiments. Certain features that are described in thisdocument in the context of separate embodiments can also be implementedin combination in a single embodiment. Conversely, various features thatare described in the context of a single embodiment can also beimplemented in multiple embodiments separately or in any suitablesub-combination. Moreover, although features may be described above asacting in certain combinations and even initially claimed as such, oneor more features from a claimed combination in some cases can be excisedfrom the combination, and the claimed combination may be directed to asub-combination or a variation of a sub-combination. Similarly, whileoperations are depicted in the drawings in a particular order, thisshould not be understood as requiring that such operations be performedin the particular order shown or in sequential order, or that allillustrated operations be performed, to achieve desirable results.

Only a few examples and implementations are disclosed. Variations,modifications, and enhancements to the described examples andimplementations and other implementations can be made based on what isdisclosed.

What is claimed is:
 1. A device, comprising: a plurality of ingresspacket processors, each receiving packets from one or more ports; acrossbar that couples the plurality of ingress packet processors to aplurality of traffic managers, wherein the crossbar is configured to:receive, from one or more of the plurality of ingress packet processors,packet data to be sent to one or more of the plurality of trafficmanagers, wherein the packet data corresponds to one or more packets;and transmit information about the packet data to the plurality oftraffic managers; and the plurality of traffic managers, each managingone or more buffers that store packet data for forwarding to packetqueues corresponding to egress packet processors, wherein each trafficmanager in the plurality of traffic managers is configured to: receive,from the crossbar, the information about the packet data destined forthe plurality of traffic managers; compute a total amount of packet datato be written to the buffers in the plurality of traffic managers;compare the total amount of packet data to one or more threshold values,including a first threshold value corresponding to a first traffic typeand a second threshold value corresponding to a second traffic type, thesecond threshold value being greater than the first threshold value;conditioned on determining that the total amount of packet data to bewritten is equal to or greater than the first threshold value but lessthan the second threshold value: upon receiving the packet data, droppacket data of the first traffic type, and write packet data of othertraffic types, including the second traffic type, to the buffers managedby the traffic manager; and conditioned on determining that the totalamount of packet data to be written is equal to or greater than thesecond threshold value: upon receiving the packet data, drop packet dataof the first and second traffic types, and write packet data of othertraffic types to the buffers managed by the traffic manager.
 2. Thedevice of claim 1, wherein each buffer includes a first buffer forstoring packet data of the first traffic type, a second buffer forstoring packet data of the second traffic type, and a third buffer forstoring packet data of other traffic types, with a priority value forthe second buffer being higher than a priority value for the firstbuffer, and a priority value for the third buffer being higher than thepriority value for the second buffer, wherein the traffic manager isconfigured to: conditioned on determining that the total amount ofpacket data to be written is equal to or greater than the firstthreshold value but less than the second threshold value: upon receivingthe packet data, drop packet data to be stored in the first buffer, andwrite packet data to be stored in the second and third buffers; andconditioned on determining that the total amount of packet data to bewritten is equal to or greater than the second threshold value: uponreceiving the packet data, drop packet data to be stored in the firstand second buffers, and write packet data to be stored in the thirdbuffer.
 3. The device of claim 1, wherein the one or more thresholdvalues are configured to limit an amount of power consumed by the devicefor writing packet data to the buffers.
 4. The device of claim 1,wherein transmitting, to the plurality of traffic managers, theinformation about the packet data comprises: delaying, at each trafficmanager, viewing of the information received from the crossbar by a timeperiod that is set to a value such that all traffic managers obtain acoordinated view of the information.
 5. The device of claim 4, whereinthe value of the time period corresponding to a particular trafficmanager depends on the largest transmission delay of all transmissiondelays from the plurality of ingress packet processors to the pluralityof traffic managers.
 6. The device of claim 4, wherein delaying theviewing of the information at each traffic manager comprises: analyzing,by each traffic manager, physical transmission delays in the device; andin response to the analyzing, determining, by each traffic manager, thetime period for the delay at the traffic manager to address the physicaltransmission delays in the device.
 7. The device of claim 1, whereininformation about the packet data received from the crossbar includesstate information provided with each cell of packet data, each trafficmanager receiving state information for all cells, wherein uponreceiving state information for all cells, each traffic managerdetermines the total amount of packet data to be written across allbuffers.
 8. The device of claim 1, wherein dropping a portion of thepacket data and writing a remaining portion of the packet data to thebuffers comprise: determine, from the information about the packet data,traffic types of the packet data destined for the traffic manager;conditioned on determining that the total amount of packet data is equalto or greater than the one or more threshold values: select traffictypes to be dropped from the packet data to be received; upon receivingthe packet data, drop packet data of the selected traffic types; andwrite packet data of other traffic types to the buffers managed by thetraffic manager.
 9. The device of claim 1, wherein the device includes anetwork switch.
 10. A method performed by a traffic manager of aplurality of traffic managers in a device, the method comprising:receiving, from a crossbar of the device, information about packet datadestined for the plurality of traffic managers, wherein the crossbarreceives the packet data from one or more of a plurality of ingresspacket processors, the crossbar coupling the plurality of ingress packetprocessors to the plurality of traffic managers, wherein each of theplurality of ingress packet processors receives packets from one or moreports, and the packet data corresponds to one or more packets; computinga total amount of packet data to be written to buffers in the pluralityof traffic managers, wherein each traffic manager of the plurality oftraffic managers manages one or more buffers that store packet data forforwarding to packet queues corresponding to egress packet processors;comparing the total amount of packet data to one or more thresholdvalues, including a first threshold value corresponding to a firsttraffic type and a second threshold value corresponding to a secondtraffic type, the second threshold value being greater than the firstthreshold value; conditioned on determining that the total amount ofpacket data is equal to or greater than the first threshold value butless than the second threshold value: upon receiving the packet data,dropping packet data of the first traffic type, and writing packet dataof other traffic types, including the second traffic type, to thebuffers managed by the traffic manager; and conditioned on determiningthat the total amount of packet data to be written is equal to orgreater than the second threshold value: upon receiving the packet data,dropping packet data of the first and second traffic types, and writingpacket data of other traffic types to the buffers managed by the trafficmanager.
 11. The method of claim 10, wherein each buffer includes afirst buffer for storing packet data of the first traffic type, a secondbuffer for storing packet data of the second traffic type, and a thirdbuffer for storing packet data of other traffic types, with a priorityvalue for the second buffer being higher than a priority value for thefirst buffer, and a priority value for the third buffer being higherthan the priority value for the second buffer, the method furthercomprising: conditioned on determining that the total amount of packetdata to be written is equal to or greater than the first threshold valuebut less than the second threshold value: upon receiving the packetdata, dropping packet data to be stored in the first buffer, and writingpacket data to be stored in the second and third buffers; andconditioned on determining that the total amount of packet data to bewritten is equal to or greater than the second threshold value: uponreceiving the packet data, dropping packet data to be stored in thefirst and second buffers, and writing packet data to be stored in thethird buffer.
 12. The method of claim 10, wherein the one or morethreshold values are selected to limit an amount of power consumed bythe device for writing packet data to the buffers.
 13. The method ofclaim 10, wherein receiving the information about the packet data fromthe crossbar comprises: delaying, at the traffic manager, viewing of theinformation received from the crossbar by a time period that is set to avalue such that all traffic managers obtain a coordinated view of theinformation.
 14. The method of claim 13, wherein the value of the timeperiod corresponding to the traffic manager depends on the largesttransmission delay of all transmission delays from the plurality ofingress packet processors to the plurality of traffic managers.
 15. Themethod of claim 13, wherein delaying the viewing of the information atthe traffic manager comprises: analyzing, by the traffic manager,physical transmission delays in the device; and in response to theanalyzing, determining, by the traffic manager, the time period for thedelay at the traffic manager to address the physical transmission delaysin the device.
 16. The method of claim 10, wherein information about thepacket data received from the crossbar includes state informationprovided with each cell of packet data, each traffic manager receivingstate information for all cells, the method further comprising: uponreceiving state information for all cells, determining the total amountof packet data to be written across all buffers.
 17. The method of claim10, wherein dropping a portion of the packet data and writing aremaining portion of the packet data to the buffers comprise:determining, from the information about the packet data, traffic typesof the packet data destined for the traffic manager; conditioned ondetermining that the total amount of packet data is equal to or greaterthan the one or more threshold values: selecting traffic types to bedropped from the packet data to be received; upon receiving the packetdata, dropping packet data of the selected traffic types; and writingpacket data of other traffic types to the buffers managed by the trafficmanager.
 18. The method of claim 10, wherein the device includes anetwork switch.
 19. A device, comprising: a plurality of ingress packetprocessors, each receiving packets from one or more ports; a crossbarthat couples the plurality of ingress packet processors to a pluralityof traffic managers, wherein the crossbar is configured to: receive,from one or more of the plurality of ingress packet processors, packetdata to be sent to one or more of the plurality of traffic managers,wherein the packet data corresponds to one or more packets; and transmitinformation about the packet data to the plurality of traffic managers;and the plurality of traffic managers, each managing one or more buffersthat store packet data for forwarding to packet queues corresponding toegress packet processors, wherein each traffic manager in the pluralityof traffic managers is configured to: receive, from the crossbar, theinformation about the packet data destined for the plurality of trafficmanagers; delay viewing of the information received from the crossbar bya time period that is set to a value such that all traffic managersobtain a coordinated view of the information; compute a total amount ofpacket data to be written to the buffers in the plurality of trafficmanagers; compare the total amount of packet data to one or morethreshold values; conditioned on determining that the total amount ofpacket data is equal to or greater than a threshold value of the one ormore threshold values: upon receiving the packet data, drop a portion ofthe packet data; and write a remaining portion of the packet data to thebuffers managed by the traffic manager.
 20. A device, comprising: aplurality of ingress packet processors, each receiving packets from oneor more ports; a crossbar that couples the plurality of ingress packetprocessors to a plurality of traffic managers, wherein the crossbar isconfigured to: receive, from one or more of the plurality of ingresspacket processors, packet data to be sent to one or more of theplurality of traffic managers, wherein the packet data corresponds toone or more packets; and transmit information about the packet data tothe plurality of traffic managers; and the plurality of trafficmanagers, each managing one or more buffers that store packet data forforwarding to packet queues corresponding to egress packet processors,wherein each traffic manager in the plurality of traffic managers isconfigured to: receive, from the crossbar, the information about thepacket data destined for the plurality of traffic managers, theinformation about the packet data including state information providedwith each cell of packet data, each traffic manager receiving stateinformation for all cells; upon receiving state information for allcells, compute a total amount of packet data to be written across allbuffers in the plurality of traffic managers; compare the total amountof packet data to one or more threshold values; conditioned ondetermining that the total amount of packet data is equal to or greaterthan a threshold value of the one or more threshold values: uponreceiving the packet data, drop a portion of the packet data; and writea remaining portion of the packet data to the buffers managed by thetraffic manager.
 21. A method performed by a traffic manager of aplurality of traffic managers in a device, the method comprising:receiving, from a crossbar of the device, information about packet datadestined for the plurality of traffic managers, wherein the crossbarreceives the packet data from one or more of a plurality of ingresspacket processors, the crossbar coupling the plurality of ingress packetprocessors to the plurality of traffic managers, wherein each of theplurality of ingress packet processors receives packets from one or moreports, and the packet data corresponds to one or more packets; delayingviewing of the information received from the crossbar by a time periodthat is set to a value such that all traffic managers obtain acoordinated view of the information; computing a total amount of packetdata to be written to buffers in the plurality of traffic managers,wherein each traffic manager of the plurality of traffic managersmanages one or more buffers that store packet data for forwarding topacket queues corresponding to egress packet processors; comparing thetotal amount of packet data to one or more threshold values; conditionedon determining that the total amount of packet data is equal to orgreater than a threshold value of the one or more threshold values: uponreceiving the packet data, dropping a portion of the packet data; andwriting a remaining portion of the packet data to the buffers managed bythe traffic manager.
 22. A method performed by a traffic manager of aplurality of traffic managers in a device, the method comprising:receiving, from a crossbar of the device, information about packet datadestined for the plurality of traffic managers, the information aboutthe packet data including state information provided with each cell ofpacket data, each traffic manager receiving state information for allcells, wherein the crossbar receives the packet data from one or more ofa plurality of ingress packet processors, the crossbar coupling theplurality of ingress packet processors to the plurality of trafficmanagers, wherein each of the plurality of ingress packet processorsreceives packets from one or more ports, and the packet data correspondsto one or more packets; upon receiving state information for all cells,computing a total amount of packet data to be written across all buffersin the plurality of traffic managers, wherein each traffic manager ofthe plurality of traffic managers manages one or more buffers that storepacket data for forwarding to packet queues corresponding to egresspacket processors; comparing the total amount of packet data to one ormore threshold values; conditioned on determining that the total amountof packet data is equal to or greater than a threshold value of the oneor more threshold values: upon receiving the packet data, dropping aportion of the packet data; and writing a remaining portion of thepacket data to the buffers managed by the traffic manager.